Temporal Mining Algorithms: Generalization and Performance Improvements
نویسندگان
چکیده
Temporal Mining Algorithms: Generalization and Performance Improvements Data mining consists of finding interesting trends or patterns in large datasets, in order to guide decisions about future activities. There is a general expectation that data mining tools should be able to identify these patterns in the data with minimal user input. The patterns identified by such tools can give a data analyst useful and unexpected insights that can be more carefully investigated subsequently. The most commonly sought patterns are association rules, that identify a frequently occurring pattern of information in the database. In the first part of research we study the problem of mining clustered association rules. The clustered and the quantitative Association Rules are useful in the context of mining rules over quantitative attributes. Since data used in data mining algorithms is usually temporal, it is very important to discover correlations of attributes over several snapshots. Information like this may affect decisions made in different areas of the business world. We study such problems as: the problem of discovering trend dependencies in temporal data, and temporal sequences mining. The discovered dependencies can be useful for many applications, including: creating special packages of promotions and sales based on customers behavior prediction, creating compact statistical information, and more. In the second part of research we propose some new approaches for mining temporal rules, based on trend dependencies discovery. Several extensions of trend dependency mining algorithms are presented in this thesis, in particular the multi-relational trend dependency mining. Algorithms with proofs of correctness and completeness are given. We also change the definition of support for a trend dependency. The algorithm can be used for mining trend dependencies of different types with variable number of relations, thus it is more general than previous approaches.
منابع مشابه
Improving the Performance of ICA Algorithm for fMRI Simulated Data Analysis Using Temporal and Spatial Filters in the Preprocessing Phase
Introduction: The accuracy of analyzing Functional MRI (fMRI) data is usually decreases in the presence of noise and artifact sources. A common solution in for analyzing fMRI data having high noise is to use suitable preprocessing methods with the aim of data denoising. Some effects of preprocessing methods on the parametric methods such as general linear model (GLM) have previously been evalua...
متن کاملDiscovering Temporal Relation Rules Mining from Interval Data
In this paper, we propose a new data mining technique that can address the temporal relation rules of temporal interval data by using Allen’s theory. We present two new algorithms for discovering temporal relationships: one is to preprocess an algorithm for the generalization of temporal interval data and to transform timestamp data into temporal interval data; and the other is to use a tempora...
متن کاملMINING FUZZY TEMPORAL ITEMSETS WITHIN VARIOUS TIME INTERVALS IN QUANTITATIVE DATASETS
This research aims at proposing a new method for discovering frequent temporal itemsets in continuous subsets of a dataset with quantitative transactions. It is important to note that although these temporal itemsets may have relatively high textit{support} or occurrence within particular time intervals, they do not necessarily get similar textit{support} across the whole dataset, which makes i...
متن کاملA Data Mining Algorithm for Generalized Web Prefetching
Predictive Web prefetching refers to the mechanism of deducing the forthcoming page accesses of a client based on its past accesses. In this paper, we present a new context for the interpretation of Web prefetching algorithms as Markov predictors. We identify the factors that affect the performance of Web prefetching algorithms. We propose a new algorithm called WMo, which is based on data mini...
متن کاملPerformance evaluation of gang saw using hybrid ANFIS-DE and hybrid ANFIS-PSO algorithms
One of the most significant and effective criteria in the process of cutting dimensional rocks using the gang saw is the maximum energy consumption rate of the machine, and its accurate prediction and estimation can help designers and owners of this industry to achieve an optimal and economic process. In the present research work, it is attempted to study and provide models for predicting the m...
متن کامل